Beyond Poisson: Modeling Inter-Arrival Time of Requests in a Datacenter

نویسندگان

  • Da-Cheng Juan
  • Lei Li
  • Huan-Kai Peng
  • Diana Marculescu
  • Christos Faloutsos
چکیده

How frequently are computer jobs submitted to an industrial-scale datacenter? We investigate the trace that contains job requests and execution collected in one of large-scale industrial datacenters, which spans near half of a Terabyte. In this paper, we discover and explain two surprising patterns with respect to the inter-arrival time (IAT) of job requests: (a) multiple periodicities and (b) multi-level bundling effects. Specifically, we propose a novel generative process, Hierarchical Bundling Model (HIBM), for modeling the data. HIBM is able to mimic multiple components in the distribution of IAT, and to simulate job requests with the same statistical properties as in the real data. We also provide a systematic approach to estimate the parameters of HIBM.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Beyond Poisson: Modeling Inter-Arrival Times of Requests in a Datacenter

How frequently are computer jobs submitted to an industrial-scale datacenter? We investigate the trace that contains job requests and execution collected in one of large-scale industrial datacenters, which spans near half of a Terabyte. In this paper, we discover and explain two surprising patterns with respect to the inter-arrival time (IAT) of job requests: (a) multiple periodicities and (b) ...

متن کامل

Distributed Data Transaction of an Apache Web Server using Bulk Service Rule

The main theme of this paper is to find the Distributed Data Transaction of an Apache web server using bulk service rule. We obtain the parameter of service rate, Arrival rate, Expected waiting time and Expected Busy period. The inter arrival and inter service of HTTP request is assumed to Poisson Distribution Process (PDP) and these events are considered in the server for process sharing. The ...

متن کامل

Modeling and Optimization of Straggling Mappers

MapReduce framework is widely used to parallelize batch jobs since it exploits a high degree of multi-tasking to process them. However, it has been observed that when the number of mappers increases, the map phase can take much longer than expected. This paper analytically shows that stochastic behavior of mapper nodes has a negative effect on the completion time of a MapReduce job, and continu...

متن کامل

Towards a carrier SDN: an example for elastic inter-datacenter connectivity.

We propose a network-driven transfer mode for cloud operations in a step towards a carrier SDN. Inter-datacenter connectivity is requested in terms of volume of data and completion time. The SDN controller translates and forwards requests to an ABNO controller in charge of a flexgrid network.

متن کامل

Statistical Description and Analysis of the Concurrent Data Transmission from Massive MTC Devices

The concurrent data transmission from massive machine type communications (MTC) devices within a short time period makes the traffic flow of MTC more bursty. The classic Markovian traffic models under the assumption of Poisson arrival, whose inter-arrival time (IAT) is a negative exponentially distributed random variable with infinite support range, are no longer suitable for this situation. Be...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014